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1. Introduction and preliminary definitions 

The results presented in this paper find their origin in some recent 
work on sub-Riemannian geometry [5], and are also strongly inspired 
by some ideas developed in the book by L.S. Pontryagin et al. [7]. The 
main purpose is to provide a comprehensive and coordinate-free proof 
of the Maximum Principle and, at the same time, to present a version 
of this principle that may be readily accessible to researchers studying 
the variational approach to dynamical systems subjected to nonholo- 
nomic constraints, also called Vakonomic dynamics. Applications of 
our results can be found, for instance, in sub-Riemannian geometry, 
where the problem of characterizing length-minimizing curves (see [5] 
and references therein) can be solved by means of the Maximum Prin- 
ciple. Also the construction of a Lagrangian and Hamiltonian dynamics 
on Lie-algebroids (see, for instance, [3, 8, 11]) could be tackled using 
the formalism described in the present paper. This will discussed in a 
forthcoming paper. 

For the present paper, we restrict ourselves to control problems 
satisfying strong smoothness conditions and we only consider optimal 
control problems with fixed time. The extension of our results to more 
general optimal control problems is currently under investigation. 

We now first give some preliminary definitions and fix some nota- 
tions. All manifolds considered in this paper are real, finite dimensional 
smooth manifolds without boundary, and by smooth we will always 
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mean of class C°°. The set of (real valued) smooth functions on a 
manifold B will be denoted by C°°{B), the set of smooth vector fields by 
X{B) and the set of smooth one-forms by X*(B). The set of all smooth 
(local or global) sections of an arbitrary fibre bundle t : E B will 
be denoted by r(r). A family V of vector fields on a manifold B is 
said to be everywhere defined if, given any point x G B, there exists an 
element X G X> such that x is contained in the domain of X. 

We now recall the concept of piecewise curve as introduced in [5]. 
First of all, by a curve in an arbitrary manifold B we shall always mean 
a smooth mapping c : I B, with I C M a closed interval, and such 
that c admits a smooth extension to an open interval containing /. A 
mapping c : [a,b] ^ B will be called a piecewise curve in B if there 
exists a finite subdivision ao '■= a < ai < . . . < ai-i < ai := b such 
that the following conditions are fulfilled: 

1. c is left continuous at each point Oj for i = 1, . . . ,i, i.e. lim^_^^- c{t) 
exists and equals c(aj); 

2. lim.^^^+ c{t) is defined for alH = 1, . . . , ^ and lim._^ + c{t) = c(ao) 

i 

(i.e. c is right continuous at qq = a); 

3. for each i = the mapping c* : [aj_i,aj] — > B, defined by 
c*(t) = c(t) for t €]aj_i,ai] and c*(ai_i) = lim. + c{t), is smooth 

i—l 

(i.e. is a curve in B). 

We will also say that the piecewise curve c is "induced by the smooth 
curves c*" . A piecewise curve which is continuous everywhere will sim- 
ply be called a continuous piecewise curve and it corresponds to what 
is usually called a 'piecewise smooth curve' in the literature. For ex- 
ample, consider two smooth curves 7* : [ai-i,ai\ — > B with i = 1,2 
such that 7^(ai) = 7^(01). According to the above definition, the 
curve 7 : [007^2] — ^ B defined by j{t) = j^{t) if t €]ai-i,ai] and 
7(00) = 7^(ao), is a continuous piecewise curve induced by 7^,7^- On 
the other hand, the piecewise curve 7, induced by 7^,7^, provides an 
example of a piecewise curve which, in general, need not be continuous. 

In this paper we will also encounter the notion of piecewise section 
of a bundle fibred over the real line, say ir : B ^ M, the definition 
of which is similar to that definition of a piecewise curve. A smooth 
section a G r(7r), defined on a closed interval / = [a,b], is always 
assumed to be the restriction of a smooth section of vr defined on an 
open interval containing /. Clearly, any section of tt determines a curve 
in B. On the other hand, if 7 : / — > S is a curve in B, then it will 
determine a section of tt iff 7r(7(t)) = t for all tel. We say that 
a : I = [a,b] ^ B is a piecewise section of tt if a is a piecewise curve 
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in B and, in addition, 7r(cr(t)) = t for all tel. Let a' : [aj_i,ai] —^ B, 
with i = !,...,£ and = a < ai < . . . < = b, represent a finite 
number of curves that induce such a piecewise section a. Then, the 
curves a* necessarily satisfy 7r(c7*(i)) = t, which implies that they are 
smooth (local) sections of tt. We then say that the smooth sections cr* 
induce the piecewise section a. A continuous piecewise section o" is a 
piecewise section a : I ^ B such that, in addition, a is a continuous 
mapping. 



2. A geometric framework for control theory 

We can now proceed towards the constrTiction of a differential geomet- 
ric setting for certain control problems. It should be emphasized that, 
although our formulation is not the most general one, if only for the 
rather strong smoothness conditions we impose, it occurs to us that 
there is a sufficiently large and relevant class of control problems that 
fit within the framework described below (see for instance [10] for a 
different approach). 

DEFINITION 1. A geometric control structure is a triple (r, v, p) con- 
sisting of (i) a fibre bundle t : M ^ IR over the real line, where M is 
called the event space, (ii) a fibre bundle u -.U ^ M, called the control 
space, and (Hi) a bundle morphism p : U ^ J^t over the identity on 
M, such that Tifl o p = u. 

In the above, J^r is the first jet bundle of r : M ^ IR, with projections 
Ti : J^T — >■ IR and ri^o '■ J^t M. The typical fibre of M plays the 
role of configuration space and will be denoted by Q. It follows from 
the definition that we have the following commutative diagram: 



T 




Let u denote a (local) section of t o u, i.e. u : I C ]R ^ U with 
T{v{u{t))) = t. With u we can associate a section c of r, called the base 
section of u and defined hy c = v o u. 
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DEFINITION 2. A smooth section u G r(r o jy) is said to be a smooth 
control if p o u = j^c, where c denotes the base section of u and j^c 
its first jet extension. A smooth section c € T{t) is called a smooth 
controlled section if c is the base section of a smooth control u. 

Let {tjX^jU"") denote an adapted coordinate system on U (i.e. adapted 
to both fibrations r and u). The condition for w G r(r o u) to be a 
smooth control is expressed in coordinates as follows: putting u{t) = 
{t, {t),u°-{t)) we must have that p'^{t, x^ {t),u°-{t)) = for all t. 

Note that these equations are in agreement with the definition of a 
control as given in [7, p 56], where M = M x iR" and U is an (open) 
subset oi M X IR^. 

DEFINITION 3. A control u : I = [a,h] ^ U is a piecewise section 

of T o u such that u is induced by a finite number of smooth controls 
and, in addition, its projection v o u is a continuous piecewise section 
ofr. A continuous piecewise section c : I ^ M of t will be called be a 
controlled section if it is the base section of a control. 

In the following, we shall show that one can associate with any sec- 
tion of a vector field on M. These vector fields will generate controls 
in the sense that (segments of) their integral curves can be regarded as 
controlled sections of r. Moreover, we shall see that also the converse 
holds: each controlled section appears to consist of a concatenation of 
integral curves of such vector fields. First, we shall specify what we 
precisely mean by a "concatenation of integral curves" of vector fields. 

Let B denote an arbitrary manifold and consider a finite ordered 
set of, say, i vector fields on B: {Xi, . . . ,Xi), which need not all be 
different. Let {(t>l} denote the flow of X,;. The composite flow $ induced 
by {Xi, . . . , Xi) is then defined as the mapping 

^■.VcM'xB^B: {{te, . . . ,ti),x) ^4^o...o 0,\(x) , 

whose domain is a subset V of x B. For brevity we shall write ^t{x) 
for $((i£, . . . ,ti),x), where T := {t^, . . . ,ti). We shall sometimes refer 
to T as the composite flow parameter. Assume that (ti,x) G Dom(0^) 
and that (t^+i , Q>i^ o...o4,j^ ) (x)) € Dom(^*+i) for i = 1, 1, then 

({ti, . . . ,ti),x) £ Dom(<I>). It can be proven that Dom($)(= V) is an 
open set (which might be empty) and that for each x & B, T i-^ ^t{x) 
is a smooth mapping defined on an open neig hborhood of G iR^. If we 
fix a value T £ of the composite flow parameter, then $7^ : B ^ B 
determines a diffeomorphism defined on an open subset of B. We refer 
to [6] (Appendix 3) for further details on composite flows. 

Fixing again some T = (t^,...,ti) G pri{V) C (with pri the 
projection of V onto M^), we can associate with any x G Dom($r) and 
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with arbitrary oq G iR, a continuous piecewise curve 7 : [oq, oq + + 
. . . + {tiW —fBas follows: putting = oq + Y.j=i l^jl ^-iid sgn(tj) := 
0, +1, —1 depending on whether ij = 0,ti > 0,ti < 0, respectively, let 



lit) = { 



^sgn(ti)(t 



_ao)(-^) te[ao,ai] 



sgn{t2){t-a,)i^h{^)) for te]ai,a2] 

. €gn(t,Kt-a,_,)i- ■ ■ <PtMA^))) for t G ]ae_i,ae], 



For t €]ai-i,ai[ we then have 'y{t) = sgn(ij)X*(7(i)) and, hence, the 
restriction of 7 to ]aj_i,aj[ is an integral curve of Xi, resp. — Xj, for 
ti > 0, resp. U < 0. Note that 7(0^) = $T(a;), i.e. the endpoint of 
7 coincides with the image of x under the composite flow map <I>t. 
If all ti > 0, then we say that 7 is a concatenation of integral curves 
through X associated with $ (or, with the ordered set {X^, ... ,Xi)) and 
corresponding to the value T of the composite flow parameter. Indeed, 
we than have ^{t) = Xi{'y{t)) for any t &]ai-i,ai]. 

Let us now return to the geometric control structure (r, v, p) and 
recall the definition of the total time derivative operator on the first jet 
bundle J^r, denoted by T : J^r — ^ TM. This is the vector field along 
the projection ri^o defined by 



dt 

where c G r(T). Note that TM{T^{jtc)) = c{t) = Ti,o(jt c). Let a be a 
section of u, then p o cr is a section of ri^o and composing it with the 
total time derivative, we obtain a mapping Topoa : M — > TM, which 
is a smooth section of tm- The vector field T o p o c is projectable with 
respect to r, and its projection on M is given by ^, i.e. Tr oT o poa = 
^ o r. This implies that, if {4>s} denotes the flow of T o p o cr and {As} 
the flow of ^ on ifZ (i.e. Xs{t) = t + s), then the equality (r o (j)s)(rn) = 
\s{T{m)) holds for any m £ M and for all s in a neighborhood of 
such that ^s(m) is defined. 

For a given a G let {^5} again denote the flow of the vector 

field TopocT on M. Assume that m G Dom((/>e), for some fixed e > and 
let r(m) = to- Consider then the curve 7(t) = (j)t-tQ{rn) in M, defined 
on I = [io, tQ+e\. From the above we know that T(7(t)) = A(_(„ (to) = t, 
implying that ^ : I ^ M can be regarded as a section of r. Moreover, 
7 is the base section of the section u : I ^ U defined by u{t) = a{^{t)). 
From the definition of u it easily follows that T{p{u(t))) = 7(t) = 
T(jt 7), which is equivalent to p{u{t)) = j}^ and, hence, m is a smooth 
control. We may therefore conclude that (up to a reparameterization) 
any integral curve of X = T o p o cr, with u G r(z/), determines a 
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controlled section. Indeed, if 7 : [a, 6] ^ M is such an integral curve, 
with m = 7(a), then the curve 7' : [T{m),T{m) + b — a] — > M , t 1-^ ^ 
r(m) + a) is a reparametrization of 7, representing a controlled section 
of T. From now on, it will always be tacitly assumed that the integral 
curves 7 of a vector field of the form T o poa will be parameterized in 
this way. 

We now introduce the following everywhere defined family of vector 
fields on M: 

P = {Topocj I Vct € r(i/)}. (1) 

Take some arbitrary sections fjj G T{iy),i = and put Xj : = 

T o p o cjj G P. Then, any concatenation 7 : [09,0^] ^ M of integral 
curves associated to the ordered set {Xg, ... ,Xi), corresponding to a 
value parameter T G 1R\_ of the composite flow parameter (where 1R\_ = 
{{ti, ■ ■ ■ ,ti) \ ti > 0}) and such that = oq + + . . . + i^, determines 
a piecewise controlled section if T(7(ao)) = oq. Indeed, it is an easy 
exercise to see that the piecewise section u induced by Ui{t) = ai{'y{t)), 
for t G [aj_i,aj], controls 7 (we are using here the notations of Section 
1). In the following we prove that the converse also holds, i.e. the base 
section of any control can be regarded as a concatenation of integral 
curves of vector fields belonging to D. We only prove the result for 
smooth controls; the proof for the more general case then easily follows. 

Let u : I ^ U he a, smooth control with base section c := u o u. 
First, assume that the image u{I) is contained in the domain of an 
adapted coordinate chart V oi U with coordinates (t, x*,m"). Consider 
a smooth extension u of u, defined on an open interval / containing I, 
i.e. u : I ^ U is a local section of t o 1/ with u{t) = u{t) for all tel. 
Upon reducing / if necessary, we may always assume that u{I) C V, 
and in terms of the adapted coordinates on V we can then write u{t) = 
{t,x^{t),u°-{t)). We can now define a local section cr of on the open 
subset V = iy{V) fl r-^^) of ^ as follows: a{t,x') = (t, x', «"(*)), 
V(t, x*) G V'. The map poa determines a section of ti^ satisfying 
poa(c{t)) = j^c{t) for any t € I. This implies that c is an integral curve 
of Topocr. In case the image set u{I) is not fully contained in an adapted 
coordinate chart, we can always cover the compact set u{I) with a finite 
number of adapted coordinate charts and choose a subdivision of / such 
that the image of each subinterval is entirely contained in one of these 
coordinate charts. The construction above can then be carried out for 
the restriction of u to each of these sub intervals, and it readily follows 
that the base section c is a concatenation of integral curves of vector 
fields in P. As mentioned above, the extension of this proof to the case 
of general controls is straightforward. Summarizing, we have shown 
that the following property holds. 
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PROPOSITION 4. A continuous piecewise section c : I ^ M is a 
controlled section iff c is a concatenation of integral curves of vector 
fields in V. 

With the family of vector fields X> on M we can associate a 'quasi-order 
relation' R on M (i.e. a reflexive and transitive relation) as follows: R 
is the subset of M x M defined by (m, n) e R if there exists a control 
u : [o, 6] — > [/ such that i'{u{a)) = m and i'{u{b)) = n (we will say that 
'the control u takes m to n'). For brevity we shall also denote (m, n) E R 
hy m ^ n, and if we want to indicate the control u explicitly, wc will 
write m —>■ n. From Proposition 4 it follows that m — > ra iff there exists 
a composite flow $ associated with an ordered set {X£, ... ,Xi), with 
Xi £ T>, such that n = $7-(m) for some T G iR^. For any m G M, the 
subset Rjn C M, defined by 

Rm = {n E M \ m ^ n} , 

is called the set of reachable points from m. 

In the next section we will first show that a quasi-order relation can 
be associated to any everywhere defined family of vector fields on an 
arbitrary manifold, and that the notion of 'set of reachable points' can 
be introduced in this more general setting. We will then investigate 
some properties of a set of reachable points that will play an important 
role in the further treatment. 



3. Some properties of the set of reachable points 

Given an everywhere defined family of vector fields V on an arbitrary 
manifold B one can define a quasi-order relation R on B as follows: for 

x,y € B wc put (.T, y) € i? if there exists a composite flow associated 
with an ordered set (X^, . . . , Xi), Xi G 2?, such that ^t{x) = V for some 

T G ]Rj^. We then also write x — ^ y (or simply x — >■ y). As described 
in the previous section, the concatenation of integral curves through x, 
determined by ^ and T, is a continuous piecewise curve 7 such that 
7(t) G T> for all t where the derivative exists. As in the previous section, 
we can then define the set of reachable points from x <£ B as the subset 
Rx = {y ^ B \ X ^ y}. Note that R^ ^ $ for all x E B, since V is 
assumed to be everywhere defined. 

Let D denote the smallest generalized integrable distribution, gen- 
erated by V (in the sense of H.J. Sussmann, see e.g. [6]) and let us 
denote the leaf of D through a given point x G .B by L^;. Recall that 
Dx is defined as the space spanned by all tangent vectors of the form 
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T^T{y{{^Ty^{x))), for y G X>, $ a composite flow associated with a 
(finite) ordered set of vector fields belonging to V, and T € such 
that X € Im(<l>2^). Then it is a simple exercise to see that Rx C Lx 
for any x G B. li V = —D, then the relation R is symmetric. Indeed, 
if ^t{x) = y, with $ the composite flow determined by {X^, . . . ,Xi) 
and T = {t£,...,ti) G iR^, then {^T)~^{y) = 3; and an elementary 
computation shows that ($r)~^ = ^'t*, where ^' is the composite flow 
corresponding to the ordered set {—Xi, . . . , —Xg) (where, by assump- 
tion, —Xi G T>) and T* = {ti,... ,tc), i.e. we also have y ^ x. In this 
case R determines an equivalence relation for which the equivalence 
classes are precisely the leafs of the foliation of the smallest integrable 
distribution D generated by X>, i.e. Rx = Lx for any x E B. 

REMARK 5. It should be emphasized here that the everywhere de- 
flned family of vector fields (1) associated to a control structure, can 
never be invariant under multiplication by —1 since, by construction, 
each vector field belonging to this V is of the form T o p o a for some 
a G r(i/) and, therefore, projects onto the fixed vector field ^ on 
IR. Moreover, the relation m ^ n is an order relation (i.e. transitive, 
reflexive and not symmetric) since, if m — > n then r(m) < r(n) holds. 

We will now investigate the local structure of the set of reachable 

points Rx for a given x ^ B. For that purpose we will introduce a special 
class of variations of a concatenation of integral curves of vector fields 
in V, connecting x with some y & Rx, such that these variations will 
lead us from x to points in a neighborhood of y that also belong to Rx- 
The following description is merely intended to give a general intuitive 
idea of the kind of variation we have in mind. We will be more specific 
later on. 

Consider the composite flow $ corresponding to an ordered set of, 
say, ^ vector fields in V, and let T G 1R\. be such that ^t{x) = y- Let 
7 : [a, 5] Bhe the concatenation of integral curves induced by $ and 
T, as constructed in the previous section, with 7(a) = x and 7(6) = y. 
Roughly speaking, a variation of 7 consists of a 1-parametcr family of 
continuous piecewise curves 7^ : [a, 6] — > i?, where e varies over an open 
interval containing 0, such that the following conditions are verified: 

1- 70 = 7; 

2. for all e, 7e(a) = x; 

3. for any e > we have that 7^ is a concatenation of integral curves 
of vector fields in T>] 

4. the map e 1— > 7e(^) is a smooth curve through h. 
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The tangent vector to the curve e i— > 7^(6) at e = is called the tangent 
vector to the variation 7^ (note that 7o(fe) = jib) = ^rix) = y). Rather 
than considering all possible variations satisfying the above conditions, 
we will mainly deal with a specific class of variations, to be determined 
below, called single variations. It will be shown that the tangent vectors 
at y to these single variations generate a convex cone in Dy (where we 
recall that D refers to the smallest integrable distribution generated by 
V) and, moreover, we will prove that each vector belonging to this 
cone is in fact a tangent vector to a variation. If we agree to call 
dimension of a cone the dimension of the linear space generated by 
all vectors belonging to the cone, then the main result of this section 
can be summarized as follows: if the dimension of the cone of tangent 
vectors at y to single variations equals the dimension of Dy, say d, 
then there exists a coordinate chart V on the leaf Ly, with y and 
coordinate functions denoted by (x^, . . . such that for any point 
z G y for which > for alH = 1, . . . , d, we have that z € Rx- 

Consider again a concatenation of integral curves 7 : [a, 6] B 
associated with the composite flow <t -.V C. x B B of an ordered 
set of £ vector fields (Xi, . . . ,Xi) in P, and with a given value T G 
of the corresponding composite flow parameter, such that 7(a) = x 
and 7(6) = ^Tix) = y. We now proceed towards the construction of 
what will be called a single variation of 7. Let T = (t^, . . . ,ti) G iR^ 
and put qq = a,a^ = h and Oj = a^-i + ti for i = !,...,£. Choose an 
arbitrary point r E]a,Q, Of] and let Y be any vector field on B such that 
7(t) belongs to the domain of Y. To fix the ideas, let us assume that 
Ui-i < T < Qi. The flow of Y will be denoted by {V's} and, as before, 
denotes the flow of Xj. We can then consider the composite flow 
<!)* : y C iR^+^ X B ^ B, associated with the ordered set of ^ + 2 
vector fields {X^, . . . , Xi,Y, Xi, . . . , Xi). Next, define 



e T*{e) = {te,...,ti+i,ai-T,e,T-ai-i,ti-i,...,ti). 



It is easily seen that there exists an open neighborhood / C iR of 0, such 
that X is contained in the domain of the map ^T*{e) e E I. For 

each e G /, let 7^ denote the concatenation of integral curves through 
X corresponding to $* and T*{e). The following sketch visualizes the 
situation for r G]ai,a2]: 
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The tangent vector to the smooth curve e i— > 7e(fe) = ^^,^^^(x) at 
e = is then given by 



d_ 

de 



where, in order to simpUfy the notations, we have introduced the map- 
ping r$«^ : r^(^)5 ^ TyB, given by 

T^-/{v) = o t4;_\ 0...0 Tcl,i^_Av) , V^; G r^(,)5 . 

Assume now that K G P. Then one can see that the 1-parameter 
family of continuous piecewise curves 7^ satisfies the conditions pro- 
posed above for a variation of 7. 

Next, suppose we take Y = —Xi and r G]ai_i,ai] for some i € 
{1, . . . , i}, then for e > (but sufficiently small) and for any t g]t, r+e], 
the tangent vector % (t) to the concatenation of integral curves through 
X, induced by <I>* and r*(e), in general will not be contained in P 
since —Xi does not have to belong to V. Consequently, if —X^ V, 
the 7g resulting from the choice Y = —Xi is, strictly speaking, not 
a variation in the sense put forward above. However, we can easily 
remedy the situation by constructing a "reduced" composite flow as 
follows. Putting r(e) = (t£, . . . , — e, . . . , ti) G M^, we see that for 
e sufficiently small, is well-defined in a neighborhood of x and, 

moreover, since (t)a^-r ° ^-e ° ^r-ai-i = ^u-e^ follows that ^x*{e) ~ 
$ j;^^^ . The concatenation of integral curves determined by $ and T'(e) 
does verify the conditions for a variation of 7. The tangent vector at 
e = to this "reduced" variation equals 



d_ 

de 



$^(^)(x) = -r$«^(X,(c(r))). 



We have thus shown that if r G]ai_i, Oj], a variation of the given 7 is 
also determined by the ordered set {X^, . . . ,Xi, —Xi, Xi, . . . ,Xi). 

To conclude, if we are given a continuous piecewise curve 7 : [a, b] 
B, with 7(a) = X, such that 7 consists of a concatenation of integral 
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curves determined by the composite flow $ and composite flow parame- 
ter T = . . . , ti) E iR^ of an ordered set of vector fields (X^, . . . , Xi) 
belonging to P, we introduce the following definition. 

DEFINITION 6. A single variation of 7 is a l-parameter family of 
continuous piecewise curves 7^ : [a, b] — > B, passing through x, with 
7o = 7, and such that for each e the corresponding 7^ is the continuous 
piecewise curve determined by the composite flow and composite flow 
parameter T* (e) associated to an ordered set of vector fields of the form 
{Xi,.. .,Xi,Y,Xi,.. .,Xi) for some ie {!,...,£}, withY € VU{-Xi} 
and where T*{e) is given by (2). (We will also briefly refer to 7^ as 'the 
single variation determined by $* and T* (e) '.) 

For later use we introduce the shorthand notation: V^x T) Li 
{—Xi\i = 1,...,^}. Whenever we consider a single variation deter- 
mined by an ordered set {Xi, . . . , Xi,Y, Xi, . . . , Xi) for some Y £ V^x, 
it will always be understood that Y = —Xj can only occur if i = j. 

Given a single variation 7^ of 7, determined by a composite flow ^* 
and composite flow parameter T*(e), one can always obtain a 'new' 
variation by considering a suitable reparameterization e(e'). More pre- 
cisely, let e' e(e') denote a smooth map satisfying e(0) = and 
5 = ^(0) > 0. Then it is not difficult to verify that and T*(e(e')) 
also determine a variation since 6 > implies that, in a neighborhood of 
0, sgn(e) = sgn(e'). The tangent vector to the curve e' ^T*(e(e'))("'') 
at e' = equals S r$^*(y(7(T))). From this one can easily derive that 
any positive multiple of a tangent vector to a single variation is again 
a tangent vector to a (not necessarily single) variation. Note that if 
SY G "D-Xj then 6 r$"^(y(7(T))) is again a tangent vector to a single 
variation. In general, however, if y € T>x, the vector field 6Y need not 
be contained in V^x- All this naturally leads to the following definition. 

DEFINITION 7. Let y € Rx and fix a composite flow <I>, corresponding 
to an ordered set (Xi, . . . , Xi) of vector fields in P, such that ^rix) = y 
for some T G iR^_ . The variational cone at y associated to ^ and T, is 
the cone CyRx{^, T) in TyB consisting of all finite linear combinations, 
with positive coefficients, of tangent vectors to single variations, i. e. 

CyRx{^,T) = { ^<5^r$^f(y,(7(T^))) I y, g v_x,s' > o, 

i=l 

t' e]ao,ae],se IN}. 

If no confusion can arise, we will often drop the explicit reference to 
$ and T and simply denote the variational cone by CyRx- It is easily 
seen that CyRx is a convex set. Indeed if t;,^ G CyRx, then (1 — 
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t)v + tw e CyRx, for any t G [0,1]. As a consequence of the next 
lemma it will be seen that any element of CyRx{^, T) can be regarded 
as a tangent vector to a variation of the continuous piecewise curve 
through X associated with $ and T. First, we introduce an alternative 
notation for composite flows which will sometimes be more convenient, 
in particular when considering compositions of composite flows. 

Let {Z(^, . . . , Zi) denote an ordered family of vector fields on a man- 
ifold with composite flow If {V'^} represents the flow of Zi for 
i = !,...,£, then it will turn out to be convenient to write 'ip^-k. . .-kt/j^ for 
the composite flow whereby it is understood that * ■ ■ .'ktjj^)^ : = 
i/jf^^ o . . . o i/jj^ = for any admissible T = {t£, . . . ,ti). Using this 
notation, we are able to define the composition ^(2) * ^E'(i) of two 
composite flows ^(2),*(i), with = V^^^*^ * . . . * ■^^^^ for i = 1,2, 
as follows 

*(2) ★ ^(1) = V'§) * • • • * ^h) * ^(i) * • • • * ^a) • 

We now have the following result, the proof of which is quite tech- 
nical. As before, we start from a given continuous piecewise curve 

7 : [ao,a^] B, with 7(00) = x, associated to the composite flow 
of an ordered set of i vector fields {X^, . . . , Xi) in I>, and a fixed value 
T of the composite flow parameter. 

LEMMA 8. Consider any finite number of (say, s) tangent vectors to 
single variations of ^, namely vi = T$^i(li(7(r*))), with Yi G 
and G]ao, a^] for i = 1, . . . , s. Then, there exists a composite flow 
associated to i + 2s vector fields, and a smooth mapping T* : JR^ — ^ 
, (eS . . . , e*) ^ r*(e\ . . . , e^) such that: 

1. = $t; 

2. X belongs to the domain of for all {e^, . . . ,e*) in some 
open neighborhood /(^) 0/ (0, . . . , 0) G 

3. for each fixed (e^^, . . . , e*) G with > for all i, the tangent 
vector to the concatenation of integral curves through x determined 
by and T*{e^ , . . . ,e^) is everywhere contained in V (possibly 
after a 'reduction' of ^* in the sense described above) such that, in 
particular, (^"^,(^^1 ^s-^{x) G Rx', 

4- the tangent vector at e = to the curve e ^T*{eS'^ eS")^^^ equals 
6^Vi, for all 5^ G IR (and where the curve is defined on a sufficiently 
small interval such that {eS^, . . . , eS^) G I^^^ ). 
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Proof. Without loss of generality, we may assume that the instants 
are ordered in such a way that T""^ < . . . < r*. Moreover, whenever 
some of the successive r* coincide, the ordering should be such that from 
the corresponding vector fields 1^, those that do not belong to V always 
precede those that do belong to V. More precisely, assume = . . . = 
with 1 < i < j < s, and let r* e]or-i, Ur] for some r £ {1, . . . ,£}. Then 
we require that if = —Xr for some k G {«,••• ,j}, and —Xr ^ T), 
we have k < k' for all those k' G {i, - ■ ■ ,j} for which Yk' G V. Such 
an arrangement can always be achieved by simply taking a suitable 
permutation of the ordered set (Kj, . . . Yj), if necessary. Henceforth, we 
will always assume, for simplicity, that the Fj's already appear in the 
correct ordering. 

For j = let Sj denote the maximum of the set {i | r* G 

]aj-i, a.j]} and put sj = sj^i if {i \ r* E]aj^i, aj]} = and sq = 0. 
The number of r*'s belonging to the j-th subinterval is then given 
by Uj = Sj — Sj-\. Let {V's} denote the flow of Yi (and, as before, 
{0^,} refers to the flow of Xj). Using the 'star' notation introduced 
above, we now consider for each j G {1,...,£}, the composite flow 
^* . ]p{i+2nj xB^ B defined by 

^ _ { (jp -kil)^! -k (jp -kil)^!'^ -k . . .-k (fp ■kij)^^-^^'^ -k (f>> if nj > 0, 
^' if Uj = 0, 

and a mapping T* : IR^^ i-^ iR^+^"^ (where it is understood that if 
Uj = 0, then T* G M): 

{{ttj — T^i ,e^i ,T^i — T^i~^ ,€^3^^ , . . . , 
if Uj > 0, 
(a-j — flj-i) if rij = 0. 

Next, by we denote the 'composition' of all the composite flows 
i.e. $* = $1 ★ . . . ★ Then, $* itself is a composite flow which 
can be evaluated at points of M^^^^ x B. If we define the mapping 
T* -.M' ^ 1R^+'^' by 

T*ie\ . . . , e^) = (T;(e^^-i+\ . . . , e^^), . . . , T^e', • • • , e^')), 

then it is easily seen that (r*(0, ... ,0),x) G Dom($*) and the equa- 
tion y = ^*^*(Q o)(^) liolds. This implies, in particular, that there 

exists an open neighborhood /^''^ of (0, . . . , 0) G iR* for which the map 
(e^, . . . , e*) 1-^ ^T*(ei e')(^^ ^^^^ defined and, hence, (2) holds. Note 
that ^s){x) can still be written as: 

^T*{e\...,e')ix) = (^l)T;(e»^-i+i,...,e''0 ° • • • ° i^l)T*(e\...,en){x). 
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For s = 1 the definitions of $* and T* coincide with those encoun- 
tered in the construction of a single variation. For any (5^, . . . , S'"^) G IR^ 
and e varying over a sufficiently small interval centered at 0, such that 
the image of the map e i— > {e6^, . . . , ed^) is contained in a straight- 
forward, but rather tedious computation shows that the tangent vector 
to the curve e ^T*(ei5i e<5=)(^)' e = 0, equals d^Vi, proving (4). 
It is also easily seen that when putting = for all i, we obtain 
^T*(o) = ^T, proving (1). 

The proof of (3) we will be provided for a particular, simplified case 
from which the idea for the general proof can then be easily deduced. 
Recall that we have chosen the ordering of the r* in such a way that, 
whenever we have a sequence t'', . . . ,{i < j) with r* = r*"*"^ = . . . = 
r-^, those vector fields which belong to the set {— Xi, . . . , —X(} and 
which are not contained in P, always appear before all the Y^i G "D in 
the sequence , . . . , Y^- . Consider now the particular case where oq < 

= = < ai < r^ Yi = V) and Ys, ^3 G ^- Then, 

Therefore, we can define a new composite fiow, associated with vector 
fields in D, by putting $i = (p^ -k ip"^ -k ip'^ -k (j)^ , and a new compos- 
ite flow parameter Ti(e^,e^,e^) = (ai — r^,e^,e^,r^ — oq — e^). Then 
($i)^H,(gi ^2 = £2 for sufficiently small, the com- 

ponents of Ti(e^,e^,e^) are positive, from which (3) readily follows 
for the 'reduced' composite flow $i and the reduced composite fiow 
parameter Ti. A similar reasoning can be applied to the general case, 
which completes the proof of the lemma. □ 

The previous lemma implies, among others, that any v in the cone 
CyRxi^,T) can be regarded as a tangent vector to a variation of 
the continuous piecewise curve 7 through x, determined by $ and T. 
Indeed, by definition of the cone CyRx{^,T) we can always write v 
(in a non-unique way) as t; = Yl'i=i ^^"^i ^or a finite number of tan- 
gent vectors to single variations Vi = T^'^iYi{j{T^)), with 5* > 0. We 
can then associate to these Vi a composite flow and a composite 
fiow parameter r*(e^, . . . , e*), as in the above lemma. Then <I>* and 
e 1-^ T*{e5^, . . . , e^'') determine a one-parameter family of continuous 
piecewise curves satisfying the conditions for a variation of 7. Moreover, 
from the above lemma it follows that the tangent vector to the curve 
e I— > $^,^^^1 at e = precisely equals v, which we wanted to 

demonstrate. 
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Note that CyRx{= CyRx{^,T)) is entirely contained in Dy (with 
D, as before, the smallest generalized integrable distribution generated 
by D). If the dimension of the cone CyRx equals d = dimD^, then 
this is equivalent to saying that the the interior of the convex cone 
CyRx, with respect to the standard vector space topology on Dy, is 
not empty. Indeed, if we have d independent vectors v^, . . . ,v'^ G CyRx, 
then the interior of the simplex in Dy, determined by the ordered set 
{0,v^ , . . . ,v'^), is contained in CyRx- The converse is an immediate 
consequence of the fact that any (nonempty) open ball in a vector 
space spans the full space. 

Before stating the main result of this section, we recall that Ly 
denotes the leaf of D passing through y (and, of course, Ly = Lx). 
Prom the theory of integrable distributions, we know that Ly is an 
immersed submanifold of B whose dimension equals the rank of D at 

y- 

THEOREM 9. Assume that the dimension of the cone CyRx equals 
the dim,ension d of Dy . Then there exists a coordinate chart V on the 
leaf Ly, with y E V and coordinate functions denoted by {x^, . . . ,x'^), 
such that for any point z & V for which x^{z) > for all i = 1, . . . ,d, 
we have that z € Rx- 

Proof. By assumption, the linear space spanned by all elements of 
CyRx equals Dy. We can therefore select a basis {vi, . . . ,Vii} of the 
linear space Dy, with Vi G CyRx for all i. By definition of CyRx, each 
Vi can then be written as a finite linear combination of tangent vectors 
to single variations, i.e. 

Si 

'"i = J2 ^[iff , ^ = 1, • • • , c^, (3) 

i=i 

for some 5^^^ G and where each uj*'' is of the form 

(») 

for some Y^^^ G V^x, T^ij &]o-Q,a£]. Although these decompositions are 
not uniquely determined, for the remainder of the proof we assume 
that for each of the given basis vectors Vi one particular decomposition 
has been singled out, i.e. we make a fixed choice for the wj'^ and for 
the positive real numbers (5^^^ appearing in (3). In total we thus have 

(i) 

s = si + . . . + Sd tangent vectors to single variations Vj which, however, 
need not all be different and/or linearly independent. For convenience, 

we introduce the following ordering: {v^^ , vi]^ , Vi'^ , vi'^) and 
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we denote an arbitrary element of this ordered set by Wa, with a = 
l,...s and such that Wa = Va^ for a = l,...si, Wa = v^-si for 

a = si + 1, . . . , si + S2, etc According to Lemma 8 wc can associate 

to the s tangent vectors to single variations, Wa, a composite flow 
and a map T* : ^ IR^+^^ such that 

1. = 

2. $^.(^1^ G i?^ if all €'>0, 

3. for any fixed {6^,..., 6^) G M^, the tangent vector to the curve 
e H- > at e = equals S'^Wa- 

With the convention that sq := 0, we have for any v G Dy that 

d Si 

V = rvi = XI ^'<^(i)"^so+...+si-i+j e . 

Putting 

we can still write v as 

s 

a=l 

Since the (5^^^ in (3) have been fixed, it follows that all the coefficients (5°, 
appearing in this decomposition of v, are determined unambiguously. 
Therefore, the following mapping is well-defined: 

f:Dy^ JR^+^', V ^ f{v) = T*{6\ ...,6'), 

and, clearly, T is smooth. 

From the properties of and T*, one can further deduce that, on 
a sufficiently small open neighborhood W of the origin in the linear 
space Dy, the mapping given by 

f:W{c Dy)-^B,v^^*~^^^{x) 

is well-defined and smooth. Moreover, by definition of $*, we have that 
/(O) = y and Im f (Z Ly. Let j : Ly ^ B denote the natural inclusion 
and let us write / for /, regarded as a mapping from W into Ly, such 
that the following relation holds: J^o / = /. Since j is an immersion 
and / is smooth, it follows that f : W{c Dy) Ly is smooth. In 
view of the natural identification ToDy = Dy, it is easily proven, using 
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property (3) of and T*, that the tangent map of / at satisfies, for 
any v = G Dy, 

^T*{e5^,...,e5-)i.x) = = V . 

This, in turn, implies that Tq/ : Dy Ty{Ly) = Dy is the identity map 
and, hence, / induces a diffeomorphism from an open neighborhood 
W d W oi Q ^ Dy onto a an open neighborhood V y va. Ly. Hence, 
to each point z & V there corresponds a unique v £ W, with f{v) = z 
and, with respect to the basis {v^ : i = 1, ... ,d} of Dy chosen above, 
we can write v = Pvi. The open set V then becomes the domain of a 
local coordinate chart on Ly, with coordinate functions {i = 1, . . . ,d) 
defined by putting = P. Finally, from property (2) of and T* 

it follows that for those vectors v = Pvi G W for which all T > 0, we 
have z = f{v) G Rx since, in this case, all the coefficients (5" appearing 
in the decomposition v = S'^Wa are also non-negative. This completes 
the proof of the theorem. □ 



Tofiv) 



d 
de 



d 
de 



Observe that the coordinate vector fields on Ly corresponding to the 
special chart constructed in the previous theorem are such that (using 

d 

the notations from the proof of the theorem) -— = To/ff*) = u*. 

y 

This observation will be of use in proving the following result, which is 
a straightforward consequence of Theorem 9. 

CORALLARY 10. Assume that CyRx has a non empty interior with 
respect to the topology of Dy (denoted by mt{CyRx))- Then, for any 
curve e : [0, 1] (L^ =)Ly with 9(0) = y and ^ (9(0) G mt{CyRx) 
there exists an e > such that 9{t') G Rx for < t' < e. 

Proof. As pointed out before, the fact that CyRx has nonempty 
interior implies that the 'dimension' of the cone equals that of Dy and so 
the previous theorem applies. One can always fix a basis Vi in Dy, with 
Vi G CyRx, such that the 6(0) is contained in the interior of the simplex 
spanned by (0,v^ , . . . ,v'^). In particular, this means that ^(0) = k'^Vi 
with all fc* g]0, 1[. Consider the coordinate chart (x^, . . . , x*^) on Ly, in a 
neighborhood of y, associated with the basis v^,. . . ,v'^ as constructed 
in Theorem 9. Note, in passing, that x*(y) = O for all i. Now, since 



d 
dx'' 



= Vi ioT i = 1, . . . , d, and putting 6** = x* o 9, we find that 

y 



d_ 
dt' 



\t') = k\ foT i = l,...,d. 
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This implies that for alH = 1, . . . , d, ^*(0) > and hence, since 9^{0) = 
0, e'^t') > for < < e and e sufficiently small, i.e. x'{e{t')) > for 
i = 1, . . . ,d. According to Theorem 9 this implies that 6{t') € Rx for 
all < t' < e. □ 

To close this section, we return to the framework of a geometric 
control structure. 

The vertical variational cone in a geometric control 
structure 

Let (r, u, p) denote an arbitrary geometric control structure. It is easily 
seen that the previous definitions and results can be applied, in particu- 
lar, to the everywhere defined family of vector fields V = {Topoa \ a e 

T{ij)} on M. Consider a pair (m, n) G M x M such that m — ^ n and 
let CnRmi^,T) denote the associated cone of variations. Since M is 
fibred over the real line, the kernel of the tangent map Tr defines a 
sub-bundle Vt = ker Tr of TM, called the vertical bundle to r. We 
will now define a 'sub-cone' of CnRm which is vertical in the sense that 
it is contained in V^r and which satisfies VnRm C CnRm- 

DEFINITION 11. The vertical variational cone at n, associated to $ 
and T, is given by: 

VnR^{^,T) = {^<5*T$^f (yHc(r^)) " c{r')) 

i=l 

\S'>Q, t' e]ao, ae\,Y' e V,i = 1, . . . , s} 

As for the variational cone, we shall also sometimes simply write VnRm 
if there can be no confusion regarding the related $ and T. 



4. The cost coordinate and optimality 

In this section we give a straightforward application of Corollary 10 
leading to necessary conditions to be satisfied by an optimal control. We 
first specify how the notion of optimality of a control can be formulated 
within the present geometric framework. 

Let (r, u, p) be an arbitrary geometric control structure (with r : 
M ^ M, ly : U ^ M, p : U ^ J^t, as in Definition 2.1) and let L G 
C°°{U) denote a function on the control bundle U . If u : I = [a,b] ^ U 
is a control, then the cost of u with respect to L is defined by 

J{u) = f L{u{t))dt. 

J a 
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If we put m = u{u{a)) and n = i/(ti(6)), we have, with the notations 
from Section 2, that m ^ n and, in particular, n G Rm. We say that 

the control u is optimal if J{v) < J^{u') for any other control u' such 

u' 

that m ^ n. For the further discussion, it will be helpful to introduce 
the following notation: 

Jti 

where ti,t2 € [a,b], with ti < t2- Note that, in this notation, J^{u) = 
^^a,6) rj,^^ function L is sometimes referred to as the cost function. 

DEFINITION 12. A geometric optimal control structure (r, p, L) 
consists of a geometric control structure (r, v, p) and a cost function L. 

We will now show that to every geometric optimal control structure 

(r, p, V, L) one can associate an extended geometric control structure, 
{t,V,p) in which the cost function is incorporated into the bundle map 
p. For that purpose, we first introduce the product space M := M x M, 
the points of which will be denoted by (m, J). For reasons to become 
clear later on, J will be called the cost coordinate. The fibration r of 
M over M induces the fibration T : M ^ M, (m, J) i— > r(m, J) = 
r(m). Next, for the extended control bundle we take U = U x IR, with 
projection onto M given by z7(n, J) = J). Finally, we can define 

a bundle map p : U ^ J^T as follows: J) = {p{u), J,L(u)), where 
we have used the canonical identification between J^r and J^t x IB? 
obtained as follows: given any section c(t) = (c(t), J{t)) of r, we map j/c 
onto {ilc, J{t), J{t)). Note that ri^o(p(^^) <^)) = i^{u, J) and, therefore, 
{t^V,])) is indeed a well-defined geometric control structure. 

Next, we shall prove that any control defined on a geometric optimal 
control structure (r, z/, p, L) indTicos a control on the extended structure 
(r, 17, p), and vice versa. Let u : I — [a.h] ^ U he a, control related to 
{t,u,p,L), with z/(?x(a)) = m and j/(n(6)) = n. We shall construct a 
control u in the associated structure (r , V, p) such that for any Jq E M 

we have {m^Jo) A (n, Jq + Ju""'^^)- More precisely, define the map 
M :/—>[/ by putting 

«(t) = Kt),Jo + j7-i«'*)). 

It is easily seen that u determines a piecewise section of r o z7 whose 
projection onto M is a continuous piecewise section. Furthermore, the 

first-order jet of the base section V ou equals o u, Jq + J/i"'*'') = 
[jliyou), Jo+ Ju"''^\ L{u{t))) . Since n is a control, we readily obtain the 
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equality pou = (fou), which impHes that u is indeed a control. On the 
other hand, the projections of u(a) and u{b) onto M are given by (m, Jq) 

and (n, Jq + J'{u)), respectively. It follows that (m, Jq) ~> (n, Jo + J^{u)) 
for the extended geometric control problem (and for arbitrary Jq G M) . 

Conversely, let u : [a,b] U, t <—>■ u{t) = {u{t),J{t)) represent a 
control on the extended geometric control structure {t,V,p). Then, if 
the base section is written as {Vou)(t) = c{t) = (c{t),J{t)) we can 
deduce from p ou = j^c that (p o u){t) = jjc, i.e. u : [a,b] U is 
a control. Moreover, the cost coordinate satisfies J{t) = L{u(t)) and, 
hence, 

Jit) = J {a) + / L{u(t))dt = J {a) + Jt*\ 

J a 

In particular, we have J{b) = J {a) + J^'^^ . 

Summarizing the preceding discussion, we have proven the following 
result. 



PROPOSITION 13. Let (r, p, L) denote a geometric optimal control 
structure. Then for any m,n E M and J^, J„ G IR, we have that m ^ n 

and J{u) = Jn — Jm for some control u iff (m, Jm) {n, Jn) in the 
associated extended geometric control structure, where J : [a, 6] — >• iR is 

given by J (t) = Jm + Ju^'^^ ■ 

Consider once more an arbitrary geometric optimal control structure 
(r, v, p, L) and assume m —>■ n for some control u. According to the 
previous proposition we then know that, for any Jq G JR, one can define 

an appropriate function J{t) such that (m, Jq) ^ (n, Jq + J{u)). Let 
c = V o [u, J) be the base section of the control {u, J) . On M we can 
then consider the variational cone C(n,Jo+j{M))^{m.,7())7 resp. the vertical 
variational cone V{n,Jo+j{u))R{m,,Jo)i associated to a composite flow $ 
and composite flow parameter T determining the controlled section c, 
with $^(m, Jo) = (n, Jo+J{u)). The proof of the following proposition 
relies on Corollary 10. 

PROPOSITION 14. Let (r, i/, p, L) denote a geometric optimal control 
structure and assume m ^ n for a control u which is optimal. Then 
the interior of C(^n,Jo+J(u))R{m,Jo) does not contain the tangent vector 

^■^ {n,jQ+J{u))' 

Proof Assume that i-d/dJ)^n,Jo+J{u)) € mt{C(^ri,Jo+J{u))R(jn,Jo))- 
Consider the 'vertical' curve 6{t) = (n, Jq + Jiu) — t) in M, de- 
fined for t G [0,1], whose tangent vector at t = precisely equals 
{—d/dJ)(^n,Jo+J{u))- Pfom Corollary 10 it then follows that there exists 
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an e > 0, sufficiently small, such that 9{t) G R(m,Jo) * ^ [0;^]. 
Prom this, one can deduce that there exists a control u' for which 

(m, Jo) ^ (n, Jo + J^{u) — e). In view of previous considerations, this 
further implies that there exists a control u' on (r, u, p, L) such that 

m ^ n, with cost J{u') = J{u) — e, J{u') < J{u). Since u was 
assumed to be optimal, this clearly leads to a contradiction. □ 



Before proceeding, we first recall some properties and terminology re- 
garding linear spaces and convex cones in a linear space. Let V be an 
arbitrary (finite dimensional) linear space. A hyperplane in V (i.e. a 
linear subspace of co-dimension one) can always be defined as the set 
of all vectors w G V satisfying (77, = for some (non-zero) co-vector 
r/ G V*. Such a hyperplane divides V into two 'half-spaces' which are 
given by the set of all v such that {ri,v) < 0, resp. {r],v) > 0, and 
which are called the 'negative' half-space and the 'positive' half-space, 
respectively. If C is a convex cone in V which docs not span the full 
space, then there always exists a hyperplane such that C is contained 
in one of the corresponding half-spaces. 

If we now return to the situation described in the previous propo- 
sition, it follows from the above considerations that, under the con- 
ditions of Proposition 4.3, there exists a hyperplane in the tangent 
space T(^n,Jo+J{u))M such that the variational cone C(^n,Jo+J{u))R(rn,.h) 
is contained in, say, the corresponding negative half-plane, whereas the 
vector {—d/dJ)i^n.Ja+j{u)) belongs to the positive half-plane. Prom the 
fact that the vertical variational cone y{n,jQ+j{u))R{rn,Jo) is a subset 
of C(^n,Jo+j{u))R(m,Jo), contained in the vertical subspace V{n,Jo+j(u)yT, 
the following result is a straightforward consequence of Proposition 4.3. 

CORALLARY 15. If m ^ n and if u is optimal, then there exists a 
hyperplane in V(^n,Jo+J{u))T^, determined by some rj G V^^^j^^j^^^^^T (the 
dual space of the vertical tangent space V(n,Jo+j{u))T^) such that 

1- iv,- Wt )> 0, and 

2. {r],v) < for all v G V^n,Jo+J{u))R{m,Jo)- 

In order to relate the previous result to a more familiar formulation 
of the necessary conditions for an optimal control, in terms of solutions 
of differential equations, we will need a minor generalization of the 
theory of connections over a bundle map as developed, for instance, in 
[1]- 



maxacta.tex; 1/02/2008; 11:39; p. 21 



22 



5. Lifts over bundle maps 

For the sake of completeness, we first briefiy recall the setting for 
defining a lift over a bundle map. 

Consider a smooth manifold B and a fibre bundle v : N ^ B, 
equipped with a bundle map A : ^ TB fibred over the identity, as 
shown in the following commutative diagram. 

A 




Note that, unlike the treatment in [1], we do not require A'' to be a 
vector bundle. Next, let tt : E ^ B denote an arbitrary fibre bundle 
over B and consider the pull-back bundle Tr*N. We can then define the 
following notion of lift. 

DEFINITION 16. A hft over A is a bundle map h : tt*N TE fibred 
over the identity on E such that the following diagram commutes: 



h 



TT*N 



TE 




A lift h over A allows us to define the h-lift of a section s of More 
precisely, the /i-lift of s G r(z^) is a section of te defined by s''(e) = 
/i(e, s(7r(e))), for all e E E. Note that s'* determines a vector field on 
E. 

A A-admissible curve c : I = [a,b] N is a smooth curve such that 
the base curve voc = cm. B satisfies c{t) = A(c(i)). If we assume that 
A(n) 7^ for all n (z N, then any A-admissible curve is a concatenation 
of integral curves of vector fields belonging to the family V = {Aos \ s G 
r(z/)}. Indeed, let c : I — > denote a A-admissible curve, with base 
curve c. Then c{t) ^ for all t, i.e. c is an immersion. Following an 
argument of S. Helgason (see [2, p 28]), one can prove that there exists 
a finite subdivision {/j} of / such that for the restriction of c to each 
of these subintervals ij there exists a local section of v verifying 
Si(c(i)) = c(i) for all t G ij. It is easily seen that C|7. is an integral 
curve of A o Sj. 
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REMARK 17. We can apply all this to a geometric control structure 
(r, f, p), where wc take B = M, N = U,K = 'Top. K control can then 
be equivalently characterized as a (T o /?)-admissible curve u : I ^ U, 
with the additional constraint that it should be a section of r o i/, 
i.e.(r o u){t) = t for all t. We also recover here the property that each 
(T o p)-admissible curve is a concatenation of integral curves of vector 
fields in V. 

Assume now that the bundle £" is a vector bundle and let A be 
the dilation vector field on E, with flow {St}. A lift h over A is then 
said to be linear if T5t o h{n, e) = /i(n, 5t{e)) for any t. Consider bundle 
adapted coordinate charts on N and E, denoted by (x*, n") and (x*, e"^), 
respectively. In coordinates, h then reads 



and /i is a linear lift iff T^{x^ ,n" ^e"^) = T^{x^ ,n")e^ . The functions 
are called the coefficients of h. For the remainder of this section, 
we always take £^ to be a vector bundle (over B). 

Given a linear lift h and a A-admissible curve c : [a, b] N, with 
base curve c, take any e E E such that 7r(e) = c(a). We can then 
construct a curve c'* in E through e, called the h-lift of c, which is 
uniquely determined by the differential equation h{c^ (t) , c{t)) = c^{t), 
with initial condition c^{a) = e (see also [1]). 

Next, we show that a linear lift h always induces a derivative oper- 
ator V, acting on sections of tt. Let 7r2 : Vtt = E Xb E ^ E denote the 
projection onto the second factor, then, in analogy with the case where 
A*" is a vector bundle and A a linear bundle map (see [1]), we can define a 
mapping K : A*TE E according to: K{n, w) = 7r2{w — h(TE{w),n)). 
Given any x e B, n e A^(= i'~^{x)) and any local section ijj G r(7r), 
defined on an open neighborhood of x, we put 



Clearly, G E-,{= 7r-^(x)). The map Vn thus defined, is a derivative 

operator on r(7r) since, for arbitrary / € C°°{B), V'i)V'2 & r(7r) (all at 
least defined on a neighborhood of x) we find that 



An operator on r(7r) satisfying these properties is called a A -derivative. 
Given any section s G r(i/), we can define the operator on r(7r) by 



/i(x^n",e^ 



) = A^■(a;^n°)— +T\x\n-,e^) 



d 




Vnf^ = A(n)(/)VX.x) + /(x)VnV, 

V„(V'l + V'2) = VnV'l + V„V'2- 
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and, obviously, Vstp is again a section of tt. It is easily seen that there 
is a one-to-one correspondence between A-dcrivatives and linear lifts 
over A. Using the above coordinate expression for h, we obtain that 
the A-derivative determined by h locally reads (for n = (x*,n") G N^) 

(Vn^)^ = A^■(a:^n")^(x^) - n")^^(x^). 
It also follows that VsV' = for s € r(z^) and tp G r(7r) iff 

for all X G -B. 

Similar to what we have in standard connection theory, a derivative 
operator can be constructed which acts on sections of tt defined along 
the base curve of a A-admissible curve c : I = [a,b] ^ N. Indeed, 
consider a curve m E, : I ^ E, such that vr o ■;/; = o c(= c), then 
the A-derivative associated to the linear lift h and acting on ^ equals 

Vc^(i) :=K{c{t),i;{t)). 

It is not difficult to prove that Vcipit) = for all t iS ij^ = c^. If 
VcV' = 0, we say that -0 is h-transported along c and that ip{b) is the 
h-transport of i^{a) along c. We conclude by pointing out that any A- 
admissible curve c in iV determines a linear map : -©^(a) — > -Ec(b)5 
called the h-transport operator along c, defined by c^{e) = ij){b), where 
'ip is the unique solution of the equation Vc''P(t) = with ?/;(a) = e. 



6. The control lift and control derivative 

Let (r, u, p) denote a geometric control structure. Consider the first- 
order jet bundle j'^u of the bundle u : U ^ M, with associated 
projections vi : J^v ^ M, ui^ : J^u ^ U. Recall that for any two local 
sections a and a' of v, defined on a neighborhood of a point m G M, 
we have that j^a = jj^a' G J^v iff a{m) = a'{m) and T^cr = T^cr' (as 
linear maps from T^M into T^(^)J7). Bearing this in mind, it is easily 
seen that the following mapping is well-defined: 

A : ^ TU, jla ^ A(i^a) = r^a((T o p){a{m))). (4) 

Moreover, A is a bundle map over the identity on U. In terms of appro- 
priate bundle coordinates (t, a;*,u") on U and {t,x^,u"',uf,uf) on J^u, 
A reads 

A{t,x\u'^,u^,ut) = {t,x\u^l,pi{t,x\u^),4 + pi{t,x\u'^)u'j)). 
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We now consider the fibred product bundle U XmVt, with projections 
pi : U Xm Vt U, {u,v) I— > pi{u,v) = u and p2 : U Xm Vt ^ 
Vt, {u,v) ^ V, whereby op^ = tm ° P2- Observing that pi : U Xm 
Vt — > [/ is a vector bundle over U , we can apply the theory from the 
previous section to the case where B = U, N = J^u, E = UxmVt and 
A is given by (4). It will be seen that, within this setting, A-admissiblc 
curves are closely related to controls. For that purpose, we need the 
following straightforward extension of the definition of A-admissible 
curve to the class of piecewise curves: a piecewise curve ij^'iaJ^v is said 
to be A-admissiblc if it is induced by (i.e. consists of a concatenation 
of) a finite number of smooth A-admissible curves. 

In the sequel, we always assume that a piecewise A-admissible curve 
Tp in J^v has a continuous projection onto M and is parameterized such 
that T{i'i{%lj{t))) = t, i.e. such that ip isa section of tovi. (Note that this 
is not a restriction since, given any A-admissiblc curve ip : [a, b] J^v, 
with ta = i'i{tpiO'))j we can consider a rcparametrization of tp according 
to ip' : [ta, ta + b-a] ^ J'^v, t ^ ip'{t) = ipit -ta + a). Then, -0' is still 
A-admissible and, moreover, satisfies t(z/i (■;/''(*))) = t.) 

LEMMA 18. The projection onto U of any A-admissible curve in J^v 
is a smooth control, and any control in U can be obtained as the pro- 
jection of a piecewise A-admissible curve. 

Proof. We first prove that the projection u = vi^ o i/; of a A- 
admissible curve ijj : [a,b] ^ J^v is a smooth control. By definition, 
we have u{t) = A(^(t)). From Ti^ o A = T o p o vi^, it follows that 
c(t) = (T o p)(u(t)), where c = i'i('ip{t)) = v o z^i^o(V'(0)- This shows 
that the smooth curve u is (T o p)-admissible, i.e. it is a smooth control. 

On the other hand, assume that u : [aojC^^] U is a control, 
with base curve c = v o u. We then know that c can be written as 
a concatenation of integral curves, induced by the composite flow $ 
of an ordered set (T o p o cr£, . . . , T o p o a\) for some cxj G r(z/), with 
composite flow parameter T = {ai — a^_i, . . . , ai — oq). Furthermore, 
u{t) = ai{c{t)) for any t G]aj_i,aj]. Putting V'i(i) = j^(^i{c{t)) for all 
t G [ai_i,ai] and z = 1, . . . we obtain that for any t G]ai_i,ai] the 
equality 



holds. Therefore, according to the definition above, the piecewise curve 
if) : [ao,ai] — > J^v, induced by the smooth curves V'i : [oi-ijOi] — > J^^^, 
is a piecewise A-admissible curve, which completes the proof of the 




lemma. 



□ 
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In the following we shall frequently make use of the natural identifica- 
tion T{U xm Vt) = TU Xtm T{Vt), without mentioning it explicitly. 
We further denote by 5 : TTM TTM the canonical involution on 
TTM. The latter is characterized by the relations Ttm os = ttm and 
TTM 05 = Ttm- 

REMARK 19. Recall that, given an arbitrary manifold B with local 
coordinates (x*), and denoting the natural bundle coordinates on TB 

and TTB by (,t*, v^) and (x*, v'^,x'^, v^), respectively, then the canonical 
involution s on TTB reads S {x'^,v\x\v^) = {x\x\ v\v^). 

For a geometric control structure (r, u, p), with bundle map A given 
by (4), we have the following property. 

PROPOSITION 20. The map h" : uIq{U XmVt) T{U Xm Vt), 
defined by 

h'^ij^a, ia{m),v)) = (A(j»,S (r(T o p)(r„c7(^)))), 

for any m G M, a G r(i/) and v G VmT{C T^M), is a linear lift over 
A. 

Proof. We first verify that h'^ indeed takes values in T(U Xm Vt). 
For that purpose, consider bundle adapted coordinates (t, and 
{t,x\v^) on U and Vt, respectively. Take m = (t, x*) G M, v = 
{t,x^,v^) G VmT and j^a = {t,x^,u°-,uf,uf) G J^u, then: 



Next, using the properties of the canonical involution operator 5, and 
taking into account (4), it is easily seen that 



Tu{A{jl^a)) = Ttm{s (r(T o p)(r„a(i;)))) = (T o p){a{m)) G T^M 



which proves indeed that Im h'^ C T{U x m Vt). 

From its definition it readily follows that is a bundle map fibred 
over the identity on U Xm Vt, and we have that 



s{T{Top){T^a{v))) = - +p%t,x\u-) -- 



V 




From this expression one can read that 



S (r(T o p){TrMX))) G n{VT){c %TM). 



Tpi{h%jla,{a{m),X))) = AUla). 
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This already guaranties that h"^ is a hft over A in the sense of Definition 
5.1. From the above coordinate expression wc can also deduce that the 
(J/Sw'^-components of h'^ are linear in the fibre coordinates v'' of 
the vector bundle pi. More precisely, we have T^(t,x^ ,v\u"',Uj) = 
with 

This shows, in particular, that h'^ is a linear lift. □ 

Note that the 'coefficients' of h'^ do not depend on the coordinates 
uf of J^z^. In a remark at the end of this section we will return to this 
point in more detail. 

Let us denote the A-derivative corresponding to h*^ by D and let 
V(z^) denote the set of r-vertical vector fields along v, i.e. 

V{u) = {Z -.U ^Vt\ tm{Z{u)) = u{u), for all u G U}. 

Note that, in view of the relation I'opi = tm°P2, we have V(z^) = r(pi). 

PROPOSITION 21. Given any j^a G J^iy and Z e V{iy), then Dji^^Z 

is contained in VmT and 

Dji^^Z = [Topoa,Zoa]{m), 

(where the square brackets on the right-hand side denote the ordinary 
Lie bracket of vector fields on M). 

Proof. Recalling the coordinate expression of a A-derivative (cf. 
Section 5), we obtain, with a slight abuse of notation. 

The result then easily follows upon substituting u^^ = and = 

in the right-hand side, and comparing this with the coordinate 
expression of the Lie bracket [T o p o a, Z o a]{m). □ 

We shall now derive an explicit expression for the /i '^-transport operator 
i/j^ determined by a A-admissible curve i/j : I = [a,b] ^ J^u. We first 
consider the case where ip takes the special form ^p{t) = j^a(c(t)) for 
some curve c : [a,b] M and a section a G P(z/). Note that such aip is 
A-admissible iff u{t) := cr(c(i)) is a smooth control, which still implies 
that in terms of the flow {4>s} of the vector field T o p o a, we have 
c{t) = (f)t-a{c{a)). 
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LEMMA 22. Let tp : [a,b] J^u, t ^ ipH) = j^a{c{t)) be a A- 
admissible curve, and let {4>s} denote the flow of T o p o a. Then 
the h'^ -transport operator i/j^ : V^c(o)''" ~^ ^c{b)'^ along if) is given by 

Proof. Representing the flow of ^ on iR by {A^}, it immediately 
foflows from t o cpg = o r that, for any v € Vt, the vector T(f)s{v) 
also belongs to Vt. In particular, we have T(f)ij_a{v) € Vc(h)T. 

Next, take vq G ^c{a) let X{t) denote the section of Vt along 
c{t) which is uniquely determined by the conditions D^{u,X){t) = 
and X{a) = vq. This is still equivalent to 



dt 



X(i)=5(r(Top)T,(,)C7(X(t))). (5) 



Since s (r(T o p)T^^^)a{X{t))) = (T o p o aY{X{t)), where (T o p o af 
denotes the complete lift of the vector field T o p o cr to TM, (5) tells 
us that X{t) is an integral curve of (T o poaY, passing through vq. By 
construction of the complete lift of a vector field, the flow of {TopoaY is 
given by {T(j)s} and, therefore, X{t) = T(j)t-a{X{a)). The result then 
follows immediately from the definition of the /i '^-transport operator 
along ip. □ 

Next, we consider the case where ijj : [a, 6] .J^u is a. pieccwisc A- 
admissible curve whose projection c = z^i o onto M is continuous. 
Recall, in particular, that u{t) := vi^Q{ip{t)) is a control (see Lemma 
18). For the sequel we will need an extension of the definition of the 
A-derivative corresponding to to piecewise curves. For that purpose, 
let X : [a, b] — > Vt be a continuous piecewise curve projecting onto the 
base curve c{t) of ip. Note, in particular, that (u, X) represents a piece- 
wise section of U Xm Vt along c in M. From the definition of piecewise 
curves it can be deduced that one can always find a sufficiently fine 
subdivision oq = a < oi . . . < = 6 of the given interval [a, 6] such 
that ip can be written as a concatenation of I smooth A-admissible 
curves ipi : [aj_i,aj] J^i/ and X as a concatenation of i smooth 
curves Xi : [a^-i, Oj] Vt. For the piecewise A-admissible curve ijj we 
now define the A-derivative D^, acting on the piecewise section {u,X), 
as follows: 

D^{u,X)(t) := D^^{u, Xi)(t) for alH G]ai_i, Oj], i = !,...,£, 

and 

D^{u,X){ao) = D^^{u,Xi){ao). 

It is easily seen in coordinates, for instance, that the mapping D^{u, X) 
from [a, b] to U x^mVt is indeed well defined. Given any vq G V^(o)''") o^ie 
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can readily verify that there exists a unique continuous piecewise curve 
X{t) in Vt such that D^{u,X){t) = for ah t G [a,b], with X{a) = 
vq. This impHes that one may introduce a (composite) /i^-transport 
operator along the piecewise A-admissible curve ip as follows: Va = 
i'^e)at-i°- ■ •°(V'i)aJ' where {ipi)al_i represents the /I'^-transport operator 
along the smooth A-admissible curve tpi, as defined in the previous 
section. If, for a given ip (and the corresponding control u), X{t) solves 
the equation D^{u,X)(t) = 0, it then follows from the definition that 
V'^(X(a))=X(6). 

We shall prove below that a piecewise A-admissiblc curve tjj, with 
vi o ^l} = c, can always be considered as being induced by smooth 
A-admissible curves : li = [ai-i,ai] — J^u of the form ipi{t) = 
j^ai{c{t)), for some local section cjj of u. Using this property we then 
know from above that (V-'Oai-i = '^'Pai-ai^i^ with {(f>l} the flow of 
T o p o (Tj. Denoting the composite flow of the ordered set (T o p o 
ae, . . . ,T o p o ai) by $ and using the shorthand notation introduced 
in Section 2, we find that the (composite) ^'^-transport operator ■^^ is 
given by 

Indeed, a straightforward computation gives: 

^^(X(a)) := {At_,o...oii^,)l^iX{a)) 

= T4>l-a,.,o...oT4>l^_^iX{a)) 
= T$^(X(a))=X(6). 

In order to prove that any (piecewise) A-admissible curve can be 
written as a concatenation of smooth A-admissible curves of the form 
j^a o c, we shall prove that any smooth A-admissible curve ijj whose 
image is entirely contained in a coordinate chart, is of that form. The 
general result then follows by a similar argument as the one applied 
in Section 2 (when proving that the base curve of any control is a 
concatenation of integral curves of vector fields in V). So, assume ip 
can be written in coordinates as tl^it) = {t,x'^(t),u"'(t),u1{t),uf{t)) for 
alH G / = [a, b]. Since ■0 is A-admissible, we then have that 

ii'^it) =u^{t)+u^{t)x\t) and ±\t) = p\t,x\t),u''{t)). 

Consider now a smooth extension tp^t) = {t,x^{t),u"'{t),u^{t),u1^(t)) of 
tp, defined on an open interval / containing /, such that Im ip is still 
contained in the same coordinate chart, with 'ip{t) = ip{t) for all tel. 
Next, we can construct a local section a oi v, defined on t~^{I), as 
follows: a{t,x) = {t,x,a''{t,x)), with a"'{t,x) = u''{t)+u'}{t){x' -x'{t)). 
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For each fixed t E I we find that 

<(i), 

and, hence, we have that j^a{t,x{t)) = ip{t) for all t E I, which is 
precisely what we wanted to prove. 

We have seen that, given a piecewise A-admissible curve tp in J^v, 
with continuous piecewise base curve = 1^10^ and corresponding 
control u = o i/j, we can regard the equation D^{u,X){t) = 
as a diff'erential equation for the component X of the curve {u, X) 
in U Xm Vt that is /I'^-transported along ip. Returning to the given 
geometric control structure {T,p,iy), we shall now explain the role of 
the /I'^-transport operator in determining the vertical variational cone 
associated to a composite fiow $ and composite fiow parameter T 
induced by an ordered set of vector fields of the form (T o p o cr), for 
some a G r(z/). 

Given any control u : [a, b] U, with base curve c = vou. In Section 
2 we have seen that c is induced by the composite flow <& of an ordered 
set of vector fields belonging to the family V given by (1), say (T o p o 
ae, . . . , Topoai), where crj G r(z/), and let the composite flow parameter 
be T = {ai — a^-i, . . . ,ai — oq), with a = oq < 04 < . . . < 6 = a^. If 
we put c(ao) = m and c(a^) = m', then the vertical variational cone 
Vm'Rmi^, T) is completely determined by the piecewise A-admissible 
curve tJj in J^u that is induced by the smooth curves ipi(t) = j^ai{c{t)). 
Indeed, it follows from Definition 11 and from the above analysis, that 
any element of Vm'Rmi^, T) can be written as a linear combination of 
/t'^-transported vertical tangent vectors along ip, i.e. 

Yi eV,S'> 0,t' e]a,b] 

Roughly speaking, one can say that the (piecewise) A-admissible 
curve ijj corresponding to the control u, contains sufficient information 
regarding the sections ai in order to determine the vertical variational 
cone Vm'Rm- Prom now on we shall therefore write Vyn'Rmi'4') if 
want to emphasise that the vertical variational cone can be generated 
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by the /I'^-transport operator along the (piecewise) A-admissible curve 

For later use we will need an extension of the action of the A- 
derivative D to 'vertical' forms, belonging to the dual of V(z^). Consider 
the fibred product bundle U Xm V*t with corresponding projections 
pI : UxmV*t -^U,P2 ■■ UxmV*t V*t, such that uopl = r|^oy*r. 
Here rj^ : V*t M denotes the dual bundle of Vt M. The dual 
module of V{v) is then given by the set 

V*(i/) = {^:U ^V*t\ Tl^{r]{u) = u{u) for all u G U}. 

Obviously, we have V*{iy) = T{pl). Given t] G V*(i/) and Z G V{u), the 
natural pairing V(z^), {f]-,Z) defines a function on U. In particular, for 
jl^a G J^u, with a{m) = u, we note that P2(^(^)) ^-^^d p2{Z{u)) belong 
to the dual linear spaces V^r and V^r, respectively. By requiring that 
for any fixed G V*(i/), the relation 

(D,. ,r7,Z(n)) = k{jla){{'n,Z)) - {r){u),Dj,^,Z), (6) 

should hold for all Z €V{u), the element Dji^^rj G {pl)-^{u){= F^r) 
is uniquely determined. 

Consider a piecewise A-admissible curve : [a,b] ^ J^u with con- 
tinuous piecewise projection c = i/ioip on M and corresponding control 
u = 1^1,0 o ip : [a,b] —>■ U. Take a continuous piecewise section r]{t) of 
V*T along c{t) such that {u, f}){t) defines a section of p^ along the curve 
u{t). We then have the following property. 

LEMMA 23. D^{u,fj){t) = tff r]{t) = ((^*)-i)*(r/(a)) for all t G 
[a,b]. 

Proof. Fix some to € I and take an arbitrary Xq G V^(fj,)r. Using 
the /I'^-transport operator along ^, we can then construct a continuous 
piecewise section X{t) of Vt along c{t) by X{t) = '0a((V'a )~'^(^o))- 
Note that X{to) = Xq. Then, with (6) we obtain 



{D^{u,r])ito),{uito),X{to))) = ^ 



{v{t),x{t)) 

to 

{{u{to),Wo)),D^{u,X){to)). 



Now it follows from the definitions that both terms on the right-hand 
vanish separately if we take rj(t) = ((-0* )~^)*(r/(a)). Indeed, with this 
choice we have {f]{t), X{t)) = {rj{a) , {Xq)) = const., and the 

equation D^{u, X){to) = holds in view of the definition of X{t). The 
remainder of the proof then follows from the uniqueness of solutions of a 
system of ordinary differential equations with given initial conditions. □ 
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The A-derivative D will play a crucial role in the proof of the Maximum 
Principle in the next section. In the following remark we briefly explain 
how some of the basic ideas in the treatment of the Maximum Principle 
in [7] can be related to our work. 

REMARK 24. The discussion of the Maximum Principle can be devel- 
oped for controls that verify the weaker assumption of being measurable 
and bounded, instead of (piecewise) smooth (see, for instance, L.S. Pon- 
tryagin et al. [7]). Using local coordinate expressions, we will roughly 
sketch how the smoothness conditions we have imposed on controls 
can also be relaxed within our framework. The local expressions for 
the equation {u, X) (t) = reads 

The condition that the functions u"(t) and uf{t) be measurable and 
bounded, suffices to obtain a solution of this equation and, subse- 
quently, to introduce a suitable notion of transport operator. This 
observation can be translated into our geometric framework as follows. 
Consider the set V^l' := UmeM{TmCr\VT '■ ^mT T„(m)U \ cr G T(iy)}. 
It can be proven by standard arguments that V^v is an affine bundle 
over U, with coordinates {t,x'■,u'^,u^) (see, for instance, [9]). Note that 
there exists a natural projection fi : J^u V^v, locally expressed by 
{t, x\ u"-, uf, uf) t-^ [t, X*, ti", u"). From the fact that the coefficients P^ 
of h'^ do not depend on the tt" (see the proof of Proposition 6.3) it easily 
follows that the A-derivative only depends on fioip. Now, since ip was 
assumed to be A-admissible, i.e. A(^/;) = n, the smoothness condition 
on u could not be relaxed. However, the curve ^ = /lo^ does not have 
to satisfy this condition, implying that the smoothness condition can 
be relaxed without losing the notion of derivative acting on sections 
of Vt along c. We can therefore conclude that, in order to define a 
vertical cone of variations associated with a measurable and bounded 
control u, we must fix a curve ijj in V^u. If one works in a coordinate 
chart, a natural choice of is the curve ^{t) = (t, c''{t),u"'it),uf{t)) 
with uf{t) = 0. The equations of the derivative associated with 

then reduce to X^^it) = ^{t,c> {t),u°-{t))X\t). These equations are 
precisely the "variational equations" introduced in [7, p79]. By fixing 
the coordinate chart, one can fix the section (T"(t,a:) = u"(t) and the 
curve il){t) = {t, cP {t),u"'{t),0), implying that, respectively a fixed ver- 
tical cone of variations and a fixed derivative associated with tp can be 
defined. This essentially establishes the link between our approach and 
the one followed by L.S. Pontryagin et al.. 



maxacta.tex; 1/02/2008; 11:39; p. 32 



Geometric aspects of the Maximum Principle 



33 



7. The Maximum Principle and extremal controls 

We will now derive the Maximum Principle by combining the tools 
developed in Section 6 and the necessary conditions for optimal controls 
derived in Section 4. 

Let (r, z^, /3, L) denote an arbitrary geometric optimal control struc- 
ture, with extended geometric control structure (t,T','p). In view of the 
structure of the bundle r : M(= M x M) M, {m, J) t-^ T{m), 
it is easily seen that the bundle of vertical tangent vectors Vt is 
isomorphic to Vt x M^. Similarly, the bundle V*t can be identified 
with V*T X iR^. In particular, given a point (m, J) G M(= M x M), a 
CO- vector rj G j^T can always be represented by a pair {r]m,i]j) for 
some r)m € V^r and r/j G M. 

Before proceeding, we still have to introduce a few additional con- 
cepts. First, we recall that the dual of a convex cone C in a vector 
space V is defined by the set C* = {a G V* | {a,v) < 0, V-y G C}. 
A general result that will be used later on, tells that C* = (cl(C))* 
and (C*)* = cl(C), where cl denotes the closure of C in V (see e.g. [4] 
for a proof). Finally, for any v £ V, the half-ray through en v, i.e. 
{w \ w = rv, Vr > 0}, will be called the 'cone generated by v\ and 
denoted C{v). 

Another concept that we will need, is that of a 'multiplier of a 
control'. For that purpose, we first construct a 1-parameter family of 
closed two- forms on U Xm V*t. Let lj be the closed two- form on the 
fibred product U Xm T*M, obtained by pulling back the canonical 
symplectic form on T*M by the projection U Xm T*M — > T*M. Next, 
for any real number A we can define a section Hx of the fibration 
U Xm T*M U Xm y*T in the following way. Take u G Um, V £ ^nJ 
and put Hx(u,r]) = {u,a), where a G T^M is uniquely determined 
by the conditions {a, T{p{u))) + \L{u) = and a projects onto 77. 
The mapping Hx is smooth, as can be easily seen from the follow- 
ing coordinate expression: putting u = {t,x'^,u"') and rj = pidx'^^^, a 
straightforward computation gives 

Hx{t, x\ u\pi) = (t, x\ u\ -p\t, x\u'')pi - XL{t, x\u^),pi) . 

We can now use Hx to pull-back the closed two-form cD to a closed two 
form on U Xm V*t, which will be denoted by ujx- Herewith, we can 
now introduce the following definition of a multiplier. 

DEFINITION 25. Given a control u : [a, b] ^ U, a pair (r/. A) consist- 
ing of a continuous piecewise section rj of V*t along c = u o u and a 
real number X, is called a multiplier ofu if the following conditions are 
satisfied: 
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1. i(^u{t),fi{t))^\ = on every smooth part of the curve {u{t),rj{t)), 

2. given any to € [a,b], and putting Hx{u{to),7]{tQ)) = {u{tQ),ao), 
the function u' i— > {ao,T{p{u'))) + \L{u'), defined on i'~^{c{to)), 
attains a global maximum for u' = u{to), 

3. (r]{t), A) / (0, 0) for all t G [a, b]. 

Returning to the geometric optimal control structure (r, v, p, L), let 

u{t) = {u{t), Jo + jit'*^) represent a control in the extended geometric 
control setting, defined on an interval [a, 6]. As before, c will denote the 
base curve of u in M (cf. Section 4), and we put c(a) = m, c(6) = n. The 
bundle map (4) associated to the extended geometric control structure 
will be written as A. Given an arbitrary piecewise A-admissible curve ?/' 
in J^V projecting onto u, we will prove in the following theorem that the 
dual of the vertical variational cone V(n,Jo+./(«))-^{"i,./o)(V') o^^Y depends 
on u. 

THEOREM 26. Let rj^ = {vo, Ao) G y*„,jo+j{u)f' ™ih fJo + 0. Then 

we have thatfjo G (V(^n,Jo+J{u))R{m,Jo){'4')) if and only if there exists 
a section rj of V*t along c, with r){b) = r)o, such that the pair {ri, Aq) is 
a multiplier of u. 

Proof. We prove that any % / in the dual of the vertical varia- 
tional cone determines a multiplier for u. The converse property will 
then simply follow by reversing the arguments. 

Let rj(t) denote the unique continuous piecewise curve in V*t satis- 
fying the equation D:^{u,fj){t) = 0, with f}{b) = fjQ. This implies that 

rj{t) = i^tTiVo)- We can write ^(t) asrj{t) = (r/(t), (Jo + r?j(t))), 
where r]{t), resp. (Jq + J'u"''^\')]j{t)) arc curves in V*t, resp. M^, such 
that ri{b) = rjo and r]j{b) = Xq. We will now prove that (??, Aq) is a 
multiplier of u. 

First of all, it is easily seen that condition (3) of Definition 25 
holds. In order to prove that (1) and (2) of the definition hold, take an 
arbitrary to G]ct)^] and u' G ^Jc(to) arbitrary. Then, we find that 

to (T(p(n', Jo + J-^'*"^)) - T{p{u{to)))) G V^n,Jo+J(u))Rim,Jo)&). 

By contracting this tangent vector with % £ (^(n,Jo+J'{u))-R(m,Jo)(V^)) > 
and taking into account the definition of the dual of a cone, we obtain 
the following inequality: 

{riito),T{p{u')) - T(p(«(to)))) + Vj{to)iLiu') - L(n(to))) < 0. (7) 



maxacta.tex; 1/02/2008; 11:39; p. 34 



Geometric aspects of the Maximum Principle 



35 



This holds for any to b] and any u' G ^c(to)- Note that this inequal- 
ity is also valid for = o- It suffices to consider a local trivialization 
of U and to interpret the left-hand side of the above inequality as a 
function of to, which is clearly continuous in a neighborhood of a. In 
particular, we deduce from the above that the function 

u' ^ {rj{to),T{p{u')) - T{p{u{tom + Vj{to){L{u') - L{u{to))), 

defined on C/c(to), admits a global maximum at u' = u{to). In local 
coordinates this means, in particular, that we have: 

m{to)^{u{to))+iij{to)-^{u{to)) = 0, (8) 

and this holds for all G [a, 6]. These relations are used in the following 
to prove that the function T/,y(t) is constant and that r/(f) satisfies 
condition (1) of Definition 25. The coefficients of the linear A-lift h 
are related to the coefHcients of in the following way (using a slight 
abuse of notation): 



p' pj p" 



Herewith, the differential equations for r]{t) and r]j{t) become, on every 
smooth part of rj: 

rjj{t) = -rjr],(t)-Tj7jj{t), 
m = -rlrij{t)-Tirjj{t). 

Taking into account the relations (8), which hold for all values of to £ 
[a,b], it is easily seen that fjj{t) = and, hence, tjj is a constant 
function, with r]j{t) = Aq. Moreover, the functions r]i(t) satisfy: 

Putting, in local coordinates, hxg{u,ri) = p'^{u)r]i + XoL{u), the two- 
form (jJ\q reads: cjaq = —dhx^ Adt + dpi A dx*. After some tedious, but 
straightforward calculations it follows that the condition i(u{t),ri{t))^Xo — 
is equivalently to 
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d_ 

dt 



and it is easily seen that the curve r]{t) defined above, satisfies these 
equations. This shows that condition (3) of Definition 25 is satisfied. It 
finally remains to prove that also the second condition for a multiplier 
holds. 

Consider the section Hx^ : U XmV*t — > U XmT*M and let us write 
for any to G [a,b], Hx^^{u{to),r]{to)) = (u{to) , a{to)) . Substituting this 
into (7), and recalling that r]j{t) = Aq, we obtain: 

{a{to),T{p{u'))} + XoL{u') (= {a{to), T(p(u(to)))) + AL(u(to))) < 0, 

proving that (2) is satisfied. This completes the proof that {r]{t),rij = 
Ao) is indeed a multiplier. □ 

As a consequence of the above theorem, the dual of the vertical vari- 
ational cone, in the extended setting, only depends on the control u 
and, hence, this is also true for the closure of this cone. Moreover, 
as an interesting side result we obtain that the closure of the vertical 
variational cone VnRm also depends on u only. Indeed, using the same 
techniques as in the above theorem it is easily seen that every mul- 
tiplier with A = 0, determines an element of the dual cone of VnRm, 
and vice versa. To simplify the notations we put m = (m, Jq) and 
n = (n, Jo + Jiu)). Recall Corollary 15, which is reformulated in the 
following way and leads us to a more familiar version of the maximum 
principle. 

CORALLARY 27. Assume that m n and that u is optimal. Then 

there exists a multiplier (77, A) with A < 0. 

The following definitions are well known from the literature. 

DEFINITION 28. A control u, with m ^ n is called an extremal if 
there exists a multiplier {r]{t),X) for which A < 0. An extremal is called 
normal, resp. abnormal, if there exists a multiplier (r/(t),A) for which 
X<0, resp. A = 0. 

An extremal is thus equivalently defined as a control for which the 

closed cone cl{VnRm) does not contain —d/dJ in its interior. Note 
that an extremal can be simultaneously abnormal and normal. We say 
that an extremal is strictly abnormal if it is abnormal but not normal. 
The following proposition gives necessary and sufficient conditions for 
a control to be an abnormal extremal or a strictly abnormal extremal. 
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PROPOSITION 29. A control is an abnormal extremal iffd{VnRm) 
VnT. A control is a strictly abnormal extremal iff —-§j is in the border 

0fc\{VnRrn)- 

Proof. The first statement follows from the fact that every element 
in the dual cone (cl(V^i?m))* corresponds to a multiplier with A = 
(see above). 

An extremal is strictly abnormal iff every element ?7o {'^^{YnRm))* 
satisfies (r/o)j > (by definition). Using the definition of the dual cone 
and the fact that C** = cl(C) for an arbitrary convex cone C, we obtain 
that — ^ is contained in c\{VnRYn)- On the other hand, since u is an 
extremal we know that — ^ is not contained in the interior of the cone 

d{VnRrn)- □ 

It should be noted that the condition VnRm 7^ docs not depend 
on the cost function L. This justifies the notion of an abnormal ex- 
tremal: u satisfies the necessary conditions for being a optimal control 
with respect to the cost L, however these conditions do not depend 
on L. The above result can be intuitively interpreted as follows: a 
control u is an abnormal extremal iff the family of vector fields V 
does not supply enough "vertical" variations to the control u. In the 
case of strictly abnormal extremals the maximum principle fails in the 
sense that Corollary 10 only gives information on those vectors lying 
in the interior of a variational cone, and not on those belonging to the 
boundary. 
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